An Integrated Environment for High-dimensional Geographic Data Mining
نویسندگان
چکیده
Introduction Geographic data are often very large in volume and “characterized by a high number of attributes or dimensions” [1]. There are urgent needs to develop effective and yet efficient approaches for analyzing such voluminous and high-dimensional data to address complex geographic problems [1, 2, 3, 4], e.g., detecting unknown multivariate patterns or relationships between socioeconomic, demographic, environmental factors and the incidence of various cancers. This paper introduces an integrated geographic data mining environment, which couples a suite of visualization and computational methods to explore multivariate patterns in large and high-dimensional geographic datasets. The integrated geographic data mining environment involves four major groups of components: (1) interactive feature selection components to identify interesting subsets of variables for further analysis[5]; (2) self-organizing map (SOM) [6] components to cluster data objects with only the variables selected above; (3) a high-dimensional visualization component—Parallel Coordinate Plot (PCP) [7]—to explore and present multivariate patterns or relationships; and (4) a geographic map component to visualize the spatial distribution of patterns discovered above. With interactive manipulation of these integrated components, the user can iteratively locate, interpret, and refine patterns.
منابع مشابه
Geostatistical Approaches for Geovisual Data Exploration, Analysis and 3D-Visualisation in Civil Security
This contribution presents selected approaches, methods and tools to facilitate geovisual analytical data exploration for civil security purposes. To analyse large emergency service data of a major German city’s fire department, different data mining techniques are applied. This allows identifying statistical significant clusters in space and time. To facilitate convenient methods for exploring...
متن کاملExtend Table Lens for High-Dimensional Data Visualization and Classification Mining
Data mining and information visualization apply different techniques to solve the same problem of extracting useful information hidden in large amount of data. In this project, we focus on classification problem and developed an integrated information visualization environment, which plugs in classification rule-mining methods and extends the Table Lens techniques to handle high-dimensional dat...
متن کاملGIS modelling for Au-Pb-Zn potential mapping in Torud-Chah Shirin area-Iran
One of the major strengths of a Geographic Information System (GIS) in geosciences is the ability to integrate and combine multiple layers into mineral potential maps showing areas which are favorable for mineral exploration. These capabilities make GIS an extremely useful tool for mineral exploration. Several spatial modeling techniques can be employed to produce potential maps. However, these...
متن کاملAn Integrated Baseline Geodatabase for Facilitating the Environmental Impact Assessment Process: Case Study of Sabalan Geothermal Project, Iran
Baseline data represent one of the important stages of Environmental Impact Assessment (EIA) procedure that describes the existing environment of the study area and surrounding areas in enough detail to allow the environmental impacts of the proposed area to be accurately and adequately assessed, and future changes and effects can be measured. Baseline data may be inaccurate, difficult to obtai...
متن کاملCalculation of One-dimensional Forward Modelling of Helicopter-borne Electromagnetic Data and a Sensitivity Matrix Using Fast Hankel Transforms
The helicopter-borne electromagnetic (HEM) frequency-domain exploration method is an airborne electromagnetic (AEM) technique that is widely used for vast and rough areas for resistivity imaging. The vast amount of digitized data flowing from the HEM method requires an efficient and accurate inversion algorithm. Generally, the inverse modelling of HEM data in the first step requires a precise a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004